ICM 2002 ■ Vol. Ill ■ 1-3 



Diophantine Methods for 
Exponential Sums, and Exponential 
Sums for Diophantine Problems 



Trevor D. Wooley* 
Abstract 

Recent developments in the theory and application of the Hardy- 
Littlewood method are discussed, concentrating on aspects associated with 
diagonal diophantine problems. Recent efficient differencing methods for esti- 
mating mean values of exponential sums are described first, concentrating on 
developments involving smooth Weyl sums. Next, arithmetic variants of clas- 
sical inequalities of Bessel and Cauchy-Schwarz are discussed. Finally, some 
emerging connections between the circle method and arithmetic geometry are 
mentioned. 
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1. Introduction 

Over the past fifteen years or so, the Hardy-Littlewood method has experi- 
enced a renaissance that has left virtually no facet untouched in its application to 
diophantine problems. Our purpose in this paper is to sketch what might be termed 
the past, present, and future of these developments, concentrating on aspects as- 
sociated with diagonal diophantine problems, and stressing modern developments 
that make increasing use of less traditional diophantine input within ambient ana- 
lytic methods. We avoid discussion of the Kloosterman method and its important 
recent variants (see [5] and [8]), because the underlying ideas seem inherently con- 
strained to quadratic, and occasionally cubic, diophantine problems. Our account 
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begins with a brief introduction to the Hardy-Littlewood (circle) method, using 
Waring's problem as the basic example. The discussion here illustrates well the 
issues involved in the analysis of systems of diagonal equations over arbitrary alge- 
braic extensions of Q, and motivates that associated with more general systems of 
homogeneous equations (see [1] and [14]). 

Let s and k be natural numbers with s > fc > 2, and consider an integer n 
sufficiently large in terms of s and k. The circle method employs Fourier analysis 
in order to obtain asymptotic information concerning the number, R(n) = Rg^kin), 
of integral solutions of the equation + ■ ■ ■ + = n. Write P = n^/^ and define 
the exponential sum f{a) = /{a; P) by 

/(a)= e{ax''), 

1<X<P 

wherein e{z) denotes e'^'^". Then it follows from orthogonality that 

-^W = / f{aye{-na)da. 
Jo 

When a is well-approximated by rational numbers with small denominators, one 
has sharp asymptotic information concerning /(a). In order to be precise, let Q 
satisfy 1 < Q < \P'^/'^, and define the major arcs 9Jt = 9Jl((3) to be the union of 
the intervals 97l(g, a) = {a & [0, 1) : \qa - a\ < QP~''}, with < a < g < Q and 
(a, q) = 1. Also, put 

S{q,a)=Y,e{ar''/q) and v{p) = eipj'')dj, 

and define f*{a) for a G [0, 1) by taking /*(«) = q^^S{q, a)v(a—a/q), when a lies in 
3n(g,a) C ^ffl{Q), and otherwise by setting /*(«) = 0. Then the sharpest available 
estimate (see Theorem 4.1 of [16]) establishes that^ /(a) = f*{a) + 0{Q'^l'^+''), 
uniformly for a G dK{Q). The fimctions S{q^ a) and v{(3) are rather well-imderstood, 
and thus one deduces that whenever s > max{4, fc -I- 1} and Q < P, then 

/ f{are{-na)da = £il±i/^e«,fe(n)n^/'=-i + 0{n^/''-'-% (1.1) 

for a suitable positive number 5. Here, the F-function is that familiar from classical 
analysis, and the singular series (3s,fc(n) is equal to the product of p-adic densities 
rip 'fp('^)7 where for each prime p we write 

Vp{n) = lim /(i-'^^cardjx e {Z/p'^Zy : a;^ + • • • + a;^ = n (mod /)}. 

h—^oo 

■'"Given a complex- valued function f(t) and positive function g{t), we use Vinogradov's 
notation f{t) <§i g{t), or Landau's notation f{t) = 0{g{t)), to mean that when t is large, there is 
a positive number C for which f{t) < Cg{t). Similarly, we write f{t) ^ g{t) when g{t) <C f{t), 
and f{t) X g{t) when /(t) <C g{t) ^ /(t). Also, we write f{t) = o{g{t)) when as t ^ oo, one 
has f{t)/g{t) 0. Finally, we use the convention that whenever e occurs in a formula, then it is 
asserted that the statement holds for each fixed positive number e. 
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An asymptotic formula for R{n), with leading term determined by the major 
arc contribution (1.1), now follows provided that the corresponding contribution 
arising from the minor arcs m = [0, 1) \ is asymptotically smaller. Although such 
is conjectured to hold as soon as s > max{4, k + 1}, this is currently known only 
for larger values of ,s. It is here that energy is focused in current research. One 
typically estimates the minor arc contribution via an inequality of the type 

/ f{aYe{—na)da 

J m 

For suitable choices of t and Q, one now seeks bounds of the shape 

sup |/(a)| « P^-^+^ and ^ \f{a)f*da ^ P^*-''+^+' , (1.3) 

with T > and 6 small enough that (s — 2t)T > 5. The right hand side of (1.2) is 
then o{n^^''~^), which is smaller than the main term of (1.1) whenever &s^k{n) ^ 1. 
The latter is assured provided that non-singular p-adic solutions can be found for 
each prime p, and in any case when s > Ak. Classically, one has two apparently 
incompatible approaches toward establishing the estimates (1.3). On one side is 
the differencing approach introduced by Weyl [23], and pursued by Hua [9], that 
yields an asymptotic formula for R{n) whenever s > 2'^ + 1. The ideas introduced 
by Vinogradov [21], meanwhile, provide the desired asymptotic formula when s > 
Ck^ log k, for a suitable positive constant C. 



< sup |/(a)| 



\f{a)rda 



(1.2) 



2. Efficient differencing and smooth Weyl sums 

Since the seminal work of Vaughan [15], progress on diagonal diophantine 
problems has been based, almost exclusively, on the use of smooth numbers, by 
which we mean integers free of large prime factors. In brief, one seeks serviceable 
substitutes for the estimates (1.3) with the underlying summands restricted to be 
smooth, the hope being that this restriction might lead to sharper bounds. Before 
describing the kind of conclusions now available, we must introduce some notation. 
Let A{P, R) denote the set of natural numbers not exceeding P, all of whose prime 
divisors are at most R, and define the associated exponential sum h{a) = h{a; P, R) 
by 

h{a;P,R)= ^ e{ax''). 

xeA{P,R) 

When <; is a positive integer, we consider the mean value StiP,R) = \hia)\^*da, 
which, by orthogonality, is equal to the number of solutions of the diophantine 
equation x'^ + ■ ■ ■ + = Ui + ■ ■■ + Vt, with Xi, yi G A{P, R) {I < i < t). We 
take i? X P'' in the ensuing discussion, with r/ a small positive number^. In these 
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circumstances one has card(^(P, i?)) ~ c{r])P, where the positive number c(r/) is 
given by the Dickman function, and it follows that St(P,R) S> P* + P^t-fe j^- jg 
conjectured that in fact S't(P, R) + We refer to the exponent A( as 

being permissible when, for each e > 0, there exists a positive number r] = 'q{t, k, e) 
with the property that whenever R < P"^, one has Si{P, R) <C P'^'"'"'^. One expects 
that the exponent At = max{t, 2t — k} should be permissible, and with this in mind 
we say that 5t is an associated exponent when = t + St is permissible, and that 
At is an admissible exponent when At = 2t — fc + Aj is permissible. 

The computations required to determine sharp permissible exponents for a 
specific value of k are substantial (see [20]), but for larger k one may summarise some 
general features of these exponents. First, for < t < 2 and fc > 2, it is essentially 
classical that the exponent (5f = is associated, and recent work of Heath-Brown 
[6] provides the same conclusion also when t = 3 and k > 238,607,918. When 
t = o{Vk), one finds that associated exponents exhibit quasi- diagonal behaviour, 
and satisfy the property that (5t as A: — > oo. To be precise, Theorem 1.3 of [28] 
shows that whenever fc > 3 and 2 <t < 2e~^k^/'^, then the exponent 

4/ci/2 / 4fc A 

-375 ' (2-1) 



et e^tV 

is associated. For larger t, methods based on repeated efficient differencing yield the 
sharpest estimates. Thus, the corollary to Theorem 2.1 of [26] establishes that for 
fc > 4, an admissible exponent At is given by the positive solution of the equation 
^^gAt/fe _ ^gi-2t//s_ rpj^g exponent \t = 2t - k + fce^"^*/'^ is therefore always per- 
missible. Previous to repeated efficient differencing, analogues of these permissible 
exponents had a term of size fce^*/'' in place of fce^^^*/*^ (sec [15]), so that in a 
sense, the modern theory is twice as powerful as that available hitherto. 

The above discussion provides a useable analogue of the mean-value estimate 
in (1.3). We turn next to localised minor arc estimates. Take Q = P, and define m 
as in the introduction. Suppose that s, t and w are parameters with 2s > A: + 1 for 
which As, Af and A^ are admissible exponents, and define 

fc-At-A,A,„ 



2{s{k -h A^ - At) + tw(l + A^)) ' 



Then Corollary 1 to Theorem 4.2 of [27] shows that sup^g^ |ft.(a)| < pi-T{k)+e^ 
for large k this estimate holds with a{k)~^ = k{\ogk + O(loglogfc)). Applying an 
analogue of (1.2) with h in place of /, and taking^ t = [ifc(log fc + loglog/c + 1)] and 
s = 2t-|-fc-|- [Afcloglogfc/logfc], for a suitable A > 0, we deduce from our discussion 
of permissible exponents that J^h{a)'^e{—na)da = o{n'^/'^~^). By considering the 
representations of a given integer n with all of the fcth powers P-smooth, it is now 



^We write [z] to denote max{n 6 Z : n < z}. 
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apparent that a modification of the argument sketched in the introduction shows 
that R{n) ^ &s,k{n)n^^''~^ as soon as one confirms that 



Sharp asymptotic information concerning h{a) is available throughout 0Jl((5) only 
when (5 is a small power of log P, and so the proof of (2.2) involves pruning technol- 
ogy. Such machinery, in this case designed to estimate the contribution from a set 
of the shape fM{P) \ 9Jl((logP)'^), has evolved into a powerful tool. Such issues can 
be handled these days with a number of variables barely exceeding max{4, fc + 1}. 

This approach leads to the best known upper bounds on the function G{k) in 
Waring's problem, defined to be the least integer r for which all sufficiently large 
natural numbers arc the sum of at most r positive integral fcth powers. 

Theorem 2.1. One has G{k) < fc(log + loglog + 2 + 0(log log fc/ logfc)). 

This upper bound (Theorem 1.4 of [27]) refines an earlier one of asymptotically 
similar strength (Corollary 1.2.1 of [24]) that gave the first sizeable improvement of 
Vinogradov's celebrated bound G{k) < (2 + o(l))fc log A:, dating from 1959 (see [22]). 
Aside from Linnik's bound G{3) < 7 (see [11]), all of the sharpest known bounds on 
G(k) for smaller k arc established using variants of these methods. Thus one has 
G*(4) < 12 (see [15], and here the ^ denotes that there are congruence conditions 



modulo 16), G(5) < 17, G(6) < 24, G(7) < 33, G(8) < 42, G(9) < 50, G(10) < 59, 
G(ll) < 67, G(12) < 76, G(13) < 84, G(14) < 92, G(15) < 100, G(16) < 109, 
G(17) < 117, G(18) < 125, G(19) < 134, G(20) < 142 (see [17], [18], [19], [20]). 



Unfortunately, shortage of space obstructs any but the crudest account of 
the ideas underlying the proof of the mean value estimates that supply the above 
permissible exponents. The use of exponential sums over smooth numbers occurs 
already in work of Linnik and Karatsuba (see [10]), but only with Vaughan's new 
iterative method [15] is a flexible homogeneous approach established. An alternative 
formulation suitable for repeated efficient differencing is introduced by the author 
in [24]. Suppose that the exponent is permissible, and consider a polynomial 
tjj e Z[t] of degree d > 2. Given positive numbers M and T with M < T, and an 
element x G A{T,R) with x > M, there exists an integer m with m G [M,MR] 
for which m\x. Consequently, by applying a fundamental lemma of combinatorial 
flavour, one may bound the number of integral solutions of the equation 



with 1 < z,w < P and Xi,yi € A{T,R) {1 < i < s), in terms of the number of 
integral solutions of the equation 





s 




(2.3) 



S 




(2.4) 
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with 1 < z,w < P, M < m < MR, {z)tp' (w) , m) = 1 and u^,v, e A{T/M,R) 
(1 < ? < ,s). The imphcit congruence condition ipi^) = '4>{w) (mod rn^) may 
be analytically refined to the stronger one z = w (mod m^), and in this way 
one is led to replace the expression il){z) — i(){w) by the difference polynomial 
■0i(z;ft,;m) = m~''{tp{z + hm'') — V'(^))- Notice that when M > P^/^, one is 
forced to conclude that z = w, and then the number of solutions of (2.4) is bounded 
above by PMRSs{T/M,R) < P'^+^M{T/M)^'' . Otherwise, following an applica- 
tion of Schwarz's inequality to the associated mean value of exponential sums, one 
may recover an equation of the shape (2.3) in which tp{z) is replaced by ipi{z), and 
T is replaced by T/M, and repeat the process once again. This gives a repeated 
differencing process that hybridises that of Weyl with the ideas of Vinogradov. 

It is now possible to describe a strategy for bounding a permissible expo- 
nent Xs+i in terms of a known permissible exponent Ag. We initially take T = P 
and i}{z) = z'^, and observe that Ss+i{P,R) is bounded above by the number of 
solutions of (2.3). We apply the above efficient differencing process successively 
with appropriate choices for M at each stage, say M = P'^\ with < (f>i < 1/k, 
for the ith differencing operation. After some number of steps, say j, we take 
<pj = 1/k in order to force the above diagonal situation that is easily estimated. 
One then optimises choices for the (pi in order to extract the sharpest upper bound 
for Ss+i{P, R), and this in turn yields a permissible exponent A^+i. It transpires 
that in this simplified treatment, successive admissible exponents are related by 
the formula A^+i = A,,(l — 4>) + k(f> — 1, wherein one may take (f> very close to 
1/(A;-|- As). Thus one finds that Ag+i is essentially As(l — 2/(fc + Ag)), an observa- 
tion that goes some way to explaining how it is that this method is about twice as 
strong as previous approaches that would correspond to choices of (j) close to 1/k. 

Refined versions of this differencing process make use of all known permissible 
exponents Ag in order to estimate a particular exponent At, and in such circum- 
stances the process becomes highly iterative, and entails significant computation. 
Such variants make use of refined Weyl estimates for difference polynomials, and es- 
timates for the number of integral points on curves and surfaces (see [20]). Variants 
of these methods apply also in the situation of Vinogradov's mean value theorem 
(see [25]), smooth Weyl sums with polynomial arguments (see [29]), and even for 
sums relevant to counting rational lines on hypersurfaces (see [12]). 

Frequent reference to underlying diophantine equations seems to limit these 
methods to estimating even moments of smooth Weyl sums, and until recently 
fractional moments could be estimated only by applying Holder's inequality to in- 
terpolate linearly between permissible exponents. However, a method [28] is now 
available that permits fractional moments to be estimated non-trivially, thereby 
"breaking classical convexity" , and moreover the number of variables being differ- 
enced need not even be an integer. These new estimates can be applied to sharpen 
permissible exponents (with integral argument), and indeed the associated exponent 
(2.1) is established in this way. Another consequence [32] of these developments is 
the best available lower bound for N{X), which we define to be the number of 
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integers not exceeding X that are represented as the sum of throe positive integral 
cubes. One has N{X) > where ^ = (\/2833 - 43)/41 = 0.24941301 . . . 

arises from the permissible exponent A3 = 3 + ^ for fc = 3. Earlier, Vaughan [15] 
obtained an estimate of the latter type with 13/4 in place of 3 + ^. 



3. Arithmetic variants of Bessel's inequality 

Already in our opening paragraph we alluded to some of the applications 

accessible to the methods of §2. We now turn to less obvious applications that 
have experienced recent progress. We illustrate ideas once again with a simple 
example, and consider the set 2{N) of integers n, with N/2 < n < N, that are not 
represented as the sum of s positive integral fcth powers. The standard approach to 
estimating Z{N) = card(Z(A^)) is via Bessel's inequality. We now take P = N^/^. 
When 05 C [0,1), write i?*(n;*B) = j^h(o.Y e{—na)da, and write also R*(n) = 
R*{n; [0, 1)). The theory of §2 ensures that when Q is a sufficiently small power 
of logP, and s > 4fc, then i?*(ri; SDT) x n''/'^~^. Under such circumstances, an 
application of Bessel's inequality reveals that Z{N) is bounded above by 



E 

N/2<n<N 



R*{n) - R*{n;m) 



R*{n;m) 



s/k-l\ 



h{aye{—na)da 



< {N 



s/k- 



-i)-2 / \h{a)f'da. 

J m 



(3.1) 



When s > ^fc(log fc+loglogfc+2+o(l)), the minor arc integral in (3.1) is o(A^^''/*'^^), 
and thus it follows that Z{N) = o{N). Thus one may conclude that almost all 
integers are sums of s ~ (^ + o(l))fclogfc positive integral A;th powers. 

The application of Bessel's inequality in (3.1) makes inefficient use of underly- 
ing arithmetic information, and fails, for example, to effectively estimate the number 
of values of a polynomial sequence not represented in some prescribed form. Sup- 
pose instead that we define a Fourier series over the exceptional set itself, namely 
K{a) — '^„e{na), where the summation is over n G Z(N). Since R*{n) = for 
n e Z{N), one has i?*(n;m) = —R*{n;dJl), and thus we see that 

iV«/fe-i^(iV) < / h{ayK{-a)da= [ h{ayK{-a)da 
Jm Jm 
Applying Schwarz's inequality in combination with Parseval's identity, we recover 
the previous consequence of Bessel's inequality via the bound 

1/2 / ^ X 1/2 



m 



h{ayK{-a)da 



< 



\K{a)\^da 



\h{a)\^'da 



(3.2) 



However, this formulation permits alternate applications of Schwarz's inequality or 
Holder's inequality. For example, the left hand side of (3.2) is bounded above by 

1/2 / ^ \ 1/2 



\h{ay*K{ay\da 



\h{a)\' 



'da 



(3.3) 
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and also by 

IKia^da^ \h{a)\'^/'da^ ^'^ . (3.4) 

In either case, the diophantine equations underlying the integrals on the left hand 
sides of (3.3) and (3.4) contain arithmetic information that can be effectively ex- 
ploited whenever the set Z{N) is reasonably thin. 

The strategy sketched above has been exploited by Briidern, Kawada and Woo- 
ley in a series of papers devoted to additive representation of polynomial sequences. 
Typical of the kind of results now available is the conclusion [3] that almost all val- 
ues of a given integral cubic polynomial arc the sum of six positive integral cubes. 
Also, Wooley [30], [31], has derived improved (slim) exceptional set estimates in 
Waring's problem when excess variables are available. For example, write E{N) for 
the number of integers n, with 1 < n < A'^, for which the anticipated asymptotic 
formula fails to hold for the number of representations of an integer as the sum of a 
square and five cubes of natural numbers. Then in [31] it is shown that E{N) <C A''^. 

As a final illustration of such ideas, we highlight an application to the solubility 
of pairs of diagonal cubic equations. Fix fc = 3, define h{a) as in §2, and put 
c(n) = Jq \h{a)\^ e{—na)da for each n e N. Briidern and Wooley [4] have applied 
the ideas sketched above to estimate the frequency with which large values of |c(n)] 
occur, and thereby have shown that, with ^ defined as in the previous section, 



c{x^-y^)\^= f t \h{afh{l3fh{a + I3f\dad(3 ^ P'^+^+' . 
Jo Jo 



x,yeA(P,R) 



On noting that 6 + ^ < 6.25, cognoscenti will recognise that this twelfth moment 
of smooth Weyl sums, in combination with a classical exponential sum equipped 
with Weyl's inequality, permits the discussion of pairs of diagonal cubic equations 
in 13 variables via the circle method. The exponent 6 + ^ improves an exponent 
6 + 2^ previously available for a (different) twelfth moment. Briidern and Wooley 
[4] establish the following conclusion. 

Theorem 3.1. Suppose that s > 13, and that ai,bi (1 < i < s) are fixed 
integers. Then the Hasse principle holds for the pair of equations 

aixl H h asxl = hxl H h bgxl = 0. 

The condition s > 13 improves on the previous bound s > 14 due to Briidern 
[2], and achieves the theoretical limit of the circle method for this problem. 



4. Arithmetic geometry via descent 

Let F{x) e Z[xi, . . . ,Xs\ be a homogeneous polynomial of degree d, and 
consider the number, N{B), of integral zeros of the equation F{x.) = 0, with 
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X g [—B, BY . When s is sufficiently large in terms of d, the circle method shows un- 
der modest geometric conditions that N{B) is asymptotic to the expected product 
of local densities. For fairly general polynomials, the condition on s is as severe as 
s > {d— 1)2'*, though for diagonal equations the methods of §2 relax this condition 
to s > (1 + o(l))(ilog(i. However, there is a class of varieties with small dimen- 
sion relative to degree, for which the circle method supplies non-trivial information 
concerning the density of rational points. The idea is to apply a descent process 
in order to interpret points on the original variety in terms of corresponding points 
on a new variety, with higher dimension relative to degree, more amenable to the 
circle method. 

To illustrate this principle, consider a field extension ii" of Q of degree n with 

associated norm form A'^(x) G Q[a;i, . . . ,Xn]. Also, let / and k be natural numbers 
with {k,l) = 1, and let a be a non-zero rational number. Then Heath-Brown and 
Skorobogatov [7] descend from the variety t'(l — t)'' = aA^(x) to the associated 
variety aN{u) + bN{v) = 2", for suitable integers a and b. The circle; nicithod 
establishes weak approximation for the latter variety, and thereby it is shown that 
the Brauer-Manin obstruction is the only possible obstruction to the Hasse principle 
and weak approximation on any smooth projective model of the former variety. 
One can artificially construct further examples amenable to the circle method. For 
example, if we take linearly independent linear forms Li(x) G Q[xi, . . . ,Xn] (1 < 
i < n + r), then one can establish non-trivial lower bounds for the density of 
rational points on the variety z*^ = Li(x) . . . L„+r(x) by descending to a variety 
that resembles a system of r diagonal forms of degree k, with constrained varying 
coefficients. The investigation of such matters will likely provide an active area 
of research into the future. In this context we point to work of Peyre [13] , which 
addresses the interaction between descent and the circle method in some generality. 
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